EVALUATION OF SPECTRAL SUBTRACTION WITH ON THE AURORA 2 Norihide Kitaoka and Seiich
نویسنده
چکیده
To reduce the effects of additive noises, spectral subtraction (SS) is often used. We discuss SS on the power spectral domain. This method has two problems to make the estimation of clean speech difficult:(1) There exists the estimation error between true power spectrum of noise and estimated one (2) The correlation between speech and noise also exists because of the phase difference. To overcome these problems, we proposed a spectral subtraction using a smoothing method of time direction. We consider the average of estimated speech power spectra over some frames as the estimated speech power spectrum. This operation makes the estimation of noise more accurate. We can reduce the effect of correlation between speech and noise. In this paper, we tested this method on the AURORA 2 database, which consists of English connected digit added with various realistic noises. We achieved 47.26% relative improvement of word accuracy with acoustic models trained under clean condition and 11.95% with models trained under multi-condition.
منابع مشابه
Noisy speech recognition based on selection of multiple noise suppression methods using noise GMMs
To achieve high recognition performance for a wide variety of noise and for a wide range of signal-to-noise ratio, this paper presents integration methods of four noise reduction algorithms: spectral subtraction with smoothing of time direction, temporal domain SVD-based speech enhancement, GMM-based speech estimation and KLT-based comb-filtering. In this paper, we proposed two types of combina...
متن کاملRobust speech recognition under noisy environments based on selection of multiple noise suppression methods
To achieve high recognition performance for a wide variety of noise and for a wide range of signal-to-noise ratio, this paper presents the integration of four noise reduction algorithms: spectral subtraction with smoothing of time direction, temporal domain SVD-based speech enhancement, GMM-based speech estimation and KLT-based comb-filtering. In this paper, we investigated the optimal suppress...
متن کاملAURORA-2J: An Evaluation Framework for Japanese Noisy Speech Recognition
This paper introduces an evaluation framework for Japanese noisy speech recognition named AURORA-2J. Speech recognition systems must still be improved to be robust to noisy environments, but this improvement requires development of the standard evaluation corpus and assessment technologies. Recently, the Aurora 2, 3 and 4 corpora and their evaluation scenarios have had significant impact on noi...
متن کاملDereverberation Based on Spectral Subtraction by Multi-Channel LMS Algorithm for Hands-Free Speech Recognition
In a distant-talking environment, channel distortion drastically degrades speech recognition performance because of a mismatch between the training and testing environments. The current approach focusing on automatic speech recognition (ASR) robustness to reverberation and noise can be classified as speech signal processing [1, 4, 5, 14], robust feature extraction [10, 20], and model adaptation...
متن کاملDistant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm
We propose a blind dereverberation method based on spectral subtraction using a multi-channel least mean squares (MCLMS) algorithm for distant-talking speech recognition. In a distant-talking environment, the channel impulse response is longer than the short-term spectral analysis window. By treating the late reverberation as additive noise, a noise reduction technique based on spectral subtrac...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002